A generalization gap estimation for overparameterized models via the Langevin functional variance

نویسندگان

چکیده

This paper discusses the estimation of generalization gap, difference between performance and training performance, for overparameterized models including neural networks. We first show that a functional variance, key concept in defining widely-applicable information criterion, characterizes gap even settings where conventional theory cannot be applied. As computational cost variance is expensive models, we propose an efficient approximation function Langevin (Langevin FV). method leverages only 1st-order gradient squared loss function, without referencing 2nd-order gradient; this ensures computation implementation consistent with gradient-based optimization algorithms. demonstrate FV numerically by estimating gaps linear regression non-linear network containing more than thousand parameters therein.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Estimation of Variance Components for Body Weight of Moghani Sheep Using B-Spline Random Regression Models

The aim of the present study was the estimation of (co) variance components and genetic parameters for body weight of Moghani sheep, using random regression models based on B-Splines functions. The data set included 9165 body weight records from 60 to 360 days of age from 2811 Moghani sheep, collected between 1994 to 2013 from Jafar-Abad Animal Research and Breeding Institute, Ardabil province,...

متن کامل

Estimation of Time Series Models via Robust Wavelet Variance

A robust approach to the estimation of time series models is proposed. Taking from a new estimation method called the Generalized Method of Wavelet Moments (GMWM) which is an indirect method based on the Wavelet Variance (WV), we replace the classical estimator of the WV with a recently proposed robust M-estimator to obtain a robust version of the GMWM. The simulation results show that the prop...

متن کامل

a new approach to credibility premium for zero-inflated poisson models for panel data

هدف اصلی از این تحقیق به دست آوردن و مقایسه حق بیمه باورمندی در مدل های شمارشی گزارش نشده برای داده های طولی می باشد. در این تحقیق حق بیمه های پبش گویی بر اساس توابع ضرر مربع خطا و نمایی محاسبه شده و با هم مقایسه می شود. تمایل به گرفتن پاداش و جایزه یکی از دلایل مهم برای گزارش ندادن تصادفات می باشد و افراد برای استفاده از تخفیف اغلب از گزارش تصادفات با هزینه پائین خودداری می کنند، در این تحقیق ...

15 صفحه اول

Variance Estimation for Tree Order Restricted Models

In this article we discuss estimation of the common variance of several normal populations with tree order restricted means. We discuss the asymptotic properties of the maximum likelihood estimator of the variance as the number of populations tends to infinity. We consider several cases of various orders of the sample sizes and show that the maximum likelihood estimator of the variance may or m...

متن کامل

A Non-degenerate Estimator for Variance Parameters in Multilevel Models via Penalized Likelihood Estimation

In this paper we consider the problem of obtaining zero estimates for group-level variance parameters in multilevel or hierarchical linear models, a problem that occurs frequently when the number of groups or the value of the variance is small. For situations where zero variances are implausible a priori, we propose a maximum penalized likelihood approach to avoid such boundary estimates. This ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Journal of Computational and Graphical Statistics

سال: 2023

ISSN: ['1061-8600', '1537-2715']

DOI: https://doi.org/10.1080/10618600.2023.2197488